Evaluating the robustness of source code plagiarism detection tools to pervasive plagiarism-hiding modifications
نویسندگان
چکیده
Source code plagiarism is a common occurrence in undergraduate computer science education. In order to identify such cases, many source detection tools have been proposed. A tool evaluates pairs of assignment submissions detect indications plagiarism. However, plagiarising student will commonly apply plagiarism-hiding modifications an attempt evade detection. Subsequently, prior work has implied that currently available are not robust the application pervasive modifications. this article, 11 evaluated for robustness against The with data sets simulated plagiarism, constructed representative students. results performed evaluations indicate which fine-grained transformations structure. Of tools, JPlag and Plaggie demonstrates greatest different types also graph-based specifically those compare programs as program dependence graphs, show potentially greater
منابع مشابه
Evaluating the Performance of LSA for Source-code Plagiarism Detection
Latent Semantic Analysis (LSA) is an intelligent information retrieval technique that uses mathematical algorithms for analyzing large corpora of text and revealing the underlying semantic information of documents. LSA is a highly parameterized statistical method, and its effectiveness is driven by the setting of its parameters which are adjusted based on the task to which it is applied. This p...
متن کاملNormalization based Stop-Word approach to Source Code Plagiarism Detection
This paper is a report of PES Institute of Technology’s participation in the Cross Language Detection of Source Code Reuse (CL-SOCO) task at FIRE 2015 [1]. We approach this task as text document plagiarism task, without considering formal programming language grammatical structure. We use normalization of commonly used identifiers to detect pair of programs which have the same objective. We als...
متن کاملEvaluating Robustness for 'IPCRESS': Surrey's Text Alignment for Plagiarism Detection
This paper briefly describes the approach taken to the subtask of Text Alignment in the Plagiarism Detection track at PAN 14. We have now reimplemented our PAN12 approach in a consistent programmatic manner, courtesy of secured research funding. PAN 14 offers us the first opportunity to evaluate the performance/consistency of this re-implementation. We present results from this re-implementatio...
متن کاملPlagiarism Detection in Computer Code
The ease with which computer code can be copied by students presents opportunities for plagiarism on programming projects. Presently, instructors are provided with few tools (if any) that assist them in detecting possible plagiarism in an assignment. Therefore, the only solution is a tedious and error prone by hand check of student submissions, looking for similarity between any two. In this th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Empirical Software Engineering
سال: 2021
ISSN: ['1382-3256', '1573-7616']
DOI: https://doi.org/10.1007/s10664-021-09990-4